PyDigger - unearthing stuff about Python


NameVersionSummarydate
trl 0.13.0 Train transformer language models with reinforcement learning. 2024-12-16 15:14:51
trl-fpo 0.0.10 Train transformer language models with reinforcement learning. 2024-12-14 16:44:48
nemo-aligner 0.5.0 NeMo-Aligner - a toolkit for model alignment 2024-11-14 23:55:58
shtec-rlhf 1.0.5 shtec-rlhf: Safe Reinforcement Learning from Human Feedback 2024-06-24 05:55:07
hourdayweektotal
2613416099275524
Elapsed time: 1.57781s